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Introduction 


Objectives: 

This  research  has  the  following  main  aims. 

1  To  develop  and  implement  a  fuzzy  object  definition  method  for  the  detection  and 
delineation  of  parenchymal  density,  masses  and  microcalcifications  in  digitized 
mammograms. 

2  To  develop  and  implement  a  fuzzy  object  definition  method  for  the  classification  of 
lesions  and  mammographic  densities. 

3  To  conduct  evaluation  studies  using  histologically  verified  mammographic  data  to 
determine  the  efficacy  of  the  proposed  methods  of  lesion  detection  and  quantitative 
classifications. 

The  focus  during  the  report  period  has  been  on  the  following  three  tasks: 

1  Select  data  sets  required  for  Technical  Objective  3,  digitize  them,  transmit  via  PACS 
to  MEPG  facilities  and  store  them  in  auxiliary  storage  medium  (optical  disks)  for  later 
use.  Recruit  Post-Doctoral  Fellow. 

2  Modify  current  implementation  to  handle  vector-valued  features  and  multivariate 
affinity  functions. 

3  Implement  lesion  detection  method,  verify  effectiveness  with  a  few  sample  data  sets 
and  refine  method  if  needed. 

Purpose: 

Existing  studies  on  false  negative  mammograms  have  found  patient  age,  tumor  histology 

and  interpretive  variability  to  contribute  to  false  negative  diagnosis.  However,  breast 

density  appears  to  be  the  primary  cause  of  missed  carcinomas.  The  radiographic 

appearance  of  female  breast  differs  from  woman  to  woman  in  relation  to  the  amounts  of 
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fat  and  fibroglandular  (connective  and  epithelial)  tissue  present.  Areas  of  fat  are 
radiographically  lucent  while  fibroglandular  tissues  are  radiographically  dense.  There 
have  been  many  studies  looking  at  the  relationship  between  mammographic  density  and 
risk  of  developing  breast  cancer.  Although  a  few  studies  reported  no  association  with 
increased  risk,  the  majority  of  studies  have  found  an  association  between  parenchymal 
patterns  and  breast  cancer  risk.  A  recent  meta-analysis  [1]  of  all  studies  confirms  that 
subjects  with  mammographic  densities  have  an  increased  risk  of  breast  cancer  relative  to 
those  without  densities.  The  risk  increases  with  the  density  of  the  breast  [2].  The  Wolfe 
classification  was  proposed  many  years  ago  to  identify  groups  of  woman  at  high  risk  for 
breast  cancer  [3].  This  scheme  was  widely  used  for  many  years,  but  has  fallen  into  disuse 
because  of  several  limitations.  For  example,  inter-observer  variability  is  a  problem  when 
the  radiologists’  subjective  assessment  is  used  to  classify  the  amount  of  density  present 
[4].  Secondly,  the  magnitude  of  the  increased  risk  has  varied  widely  in  the  published 
studies  [1].  Thirdly,  identification  of  this  risk  factor  for  a  given  woman  has  not  altered 
screening  recommendations  [5,6].  Recently  an  computer-assisted  user-interactive  method 
to  quantify  mammographic  density  has  been  published  [5],  which  concluded  that 
quantitative  classification  of  densities  allows  for  more  specific  gradients  of  risk  than  do 
Wolfe’s  classifications. 

An  objective,  repeatable  quantitative  measure  of  risk  derived  from  mammographic 
densities  will  be  useful  in  recommending  alternative  screening  process.  An  architecture 
dependent  quantitative  analysis  of  the  mammographic  densities  will  make  the  screening 
process  more  effective.  Image  processing  efforts  toward  this  goal  seem  to  be  very  sparse 
in  the  literature,  and  automatic  and  efficient  methods  for  generating  this  measure  do  not 
seem  to  exist. 

The  focus  of  this  research  is  on  utilizing  and  extending  recently  developed  fuzzy 
connectedness  method  [17]  to  fulfill  the  main  objectives.  This  methodology  has  been 
successfully  applied  in  several  applications  including  multiple  sclerosis  lesion  detection 
[7-11],  MR  angiography  [12]  and  craniofacial  soft  tissue  imaging  [13].  The  approach  of 
integrating  density-derived  information  (total  density  and  architecture)  into  the  lesion 
detection  method  will  hopefully  further  improve  this  accuracy. 
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Scope: 

Computer-assisted  analysis  of  mammographic  density  would  provide  an  objective, 
quantitative  measure  of  cancer  risk  factor.  This  measure  will  be  useful  in  total  risk 
analysis  in  several  ways.  First,  such  risk  analysis  could  influence  the  choice  of  alternative 
screening  paradigms  such  as  intervals  between  mammograms  or  use  of  other  modalities 
such  as  MRI.  Second,  this  measure  could  be  useful  in  selecting  a  group  of  women  for 
whom  the  risk-benefit  ratio  of  a  potentially  toxic  preventive  measure,  such  as  tamoxifin, 
would  be  favorable  [14,15].  Third,  this  measure  could  be  used  to  signal  the  need  for  more 
careful  interpretation  of  a  subset  of  mammograms.  For  example,  double-reading  might  be 
indicated  for  mammograms  above  a  certain  level  of  density.  Fourth,  a  variety  of 
computer-assisted  techniques  continue  to  be  developed  for  mammographic  lesion 
detection.  No  single  algorithm  is  optimal  for  all  mammograms.  Objective  density 
quantification  could  be  helpful  for  selecting  the  most  appropriate  computerized  method, 
or  as  we  propose  here,  for  tailoring  a  selected  algorithm.  Clearly,  an  automatic,  accurate, 
and  objective  method  for  density  quantification  will  allow  the  study  of  the  effect  of  this 
variable  on  the  implementation  of  mammograms. 


Methods 


Task  1: 

A  postdoctoral  fellow  has  been  recruited  for  this  project.  This  research  utilizes  only 
existing  image  data.  No  patients  are  imaged  expressly  for  the  purpose  of  this  project.  We 
have  so  far  digitized  (at  a  resolution  of  100  microns)  existing  mammograms  which 
consists  of  39  normal  cases,  43  masses  and  42  calcifications  from  our  patient  database  in 
the  hospital.  These  images  have  been  transmitted  to  our  (MIPG)  facility,  converted  to  our 
local  format  (the  IMO  format  of  3DVIEWNIX  [18],  the  software  platform  used  in  the 
project),  and  stored  on  a  medium.  We  continue  to  get  more  digital  data. 
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Task  2: 


We  have  spent  a  considerable  amount  of  time  investigating  the  affinity  relation  that  are 
appropriate  for  segmenting  digitized  mammograms.  A  novel  scale-based  fuzzy  affinity 
and  connectedness  method  has  been  developed,  implemented  in  3DVIEWNIX  [18]  and 
tested.  A  manuscript  has  been  submitted  [16]  on  this  topic.  A  brief  summary  of  its 
approach  is  given  below. 

Fuzzy  affinity  between  two  nearby  pixels  is  a  reflexive  and  symmetric  relation  whose 
strength  lies  between  0  and  1  and  indicates  how  the  pixels  locally  “hang  together”  in  the 
image.  The  notion  of  fuzzy  affinity  is  expressed  as  a  nondecreasing  function  of  fuzzy 
adjacency  (dependent  only  on  the  distance  between  pixels),  their  homogeneity  and  their 
agreement  to  some  global  intensity-based  object  property  or  feature.  In  determining  the 
homogeneity  and  feature-based  components  of  affinity  between  two  pixels  p\  and  p2  ,  a 
neighborhood  around  both  p\  and  p2  are  considered.  The  size  of  this  neighborhood, 
called  scale,  is  not  fixed  but  depends  on  the  size  of  the  largest  homogeneous  region 
locally.  The  scale  is  first  computed  automatically  for  all  pixel  in  the  image. 

Fuzzy  connectedness  between  any  two  (not  necessarily  nearby)  pixels  p\  and  p2  is  a 
fuzzy  relation  whose  strength  lies  between  0  and  1.  It  is  determined  by  considering  all 
possible  paths  between  p\  and  p2  .  A  path  is  simply  a  sequence  of  nearby  pixels.  A 
strength  of  connectedness  is  assigned  to  each  path  which  is  simply  the  smallest  affinity  of 
successive  pixels  along  the  path.  The  strength  of  connectedness  between  p\  and  p2  is  the 
largest  of  the  strengths  of  all  possible  paths  between  p\  and  p2  . 

In  one  way,  we  can  imagine  that  two  pixels  are  strongly  connected  to  each  other  if  there 
is  a  path  between  them  through  locally  homogeneous  regions.  We  call  it  homogeneity 
based  connectedness.  In  another  way,  the  connectedness  between  two  pixels  could  be 
imagined  as  their  likeliness  to  fit  the  object  features  and  unlikeliness  to  fit  the  background 
features.  We  call  it  feature  based  connectedness.  One  may  note  that  feature  based 
connectedness  does  not  have  the  notion  of  path  homogeneity  and  thus  fails  to  overcome 
the  effects  of  slowly  varying  components  over  the  image.  On  the  other  hand, 
homogeneity  based  connectedness  suffers  from  the  fact  that  in  many  applications  objects 
merge  to  background  so  smoothly  that  always  there  is  a  path  from  object  to  background 
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through  locally  homogeneous  regions;  this  is  more  pronounced  when  blurring  or  partial 
voluming  is  high.  Moreover,  especially  in  a  thin  branch  of  an  object,  often,  there  are 
small  regions  with  high  inhomogeneity  that  stop  the  homogeneity  based  paths  for  the  rest 
of  the  branch.  Therefore,  the  two  different  notions  of  connectedness  are  combined. 
Additionally,  the  notion  of  scale  allows  the  correct  estimation  of  homogeneity  and  object 
features  that  is  insensitive  to  noise. 

A  detailed  mathematical  formulation  of  scale-based  fuzzy  affinity,  connectedness,  the 
associated  algorithms,  and  their  validation  on  both  2D  and  3D  clinical  images  and 
phantoms  is  presented  in  [16].  A  careful  statistical  evaluation  indicates  that  the  scale- 
based  method  is  much  superior  to  the  original  method  described  in  [17] 

Task  3: 

We  have  devised  and  implemented  several  methods  for  characterizing  abnormal 
parenchymal  architecture  using  the  scaled-based  connectedness.  These  are  currently 
being  tested  and  evaluated  for  their  efficacy  in  identifying  and  characterizing  lesions. 

In  addition,  a  major  part  of  Task  6  (stated  in  the  proposal)  has  been  completed.  This  task 
has  to  do  with  developing  a  method  to  automatically  quantify  parenchymal  density  and  to 
validate  it.  We  felt  it  was  more  natural  to  accomplish  this  task,  once  the  fuzzy 
connectedness  method  is  in  plane.  Besides,  we  also  felt  that  the  segmentation  and 
quantification  of  parenchymal  density  would  be  useful  in  lesion  detection  and 
characterization.  A  brief  summary  of  this  work  is  given  below.  A  manuscript  on  this 
work  is  now  completed  [20]  and  being  readied  for  journal  submission. 

Segmentation  of  breast  from  background:  At  the  very  beginning,  using  3DVIEWNIX 
[18]  supported  LIVE-WIRE  [19]  tool,  regions  corresponding  to  pectoral  muscles  are 
interactively  excluded  when  those  are  projected  in  the  image.  In  the  entire  process,  this  is 
the  only  step  requiring  operator  intervention.  Fuzzy  connectivity  is  used  as  the  underlying 
technique  in  segmenting  breast  from  background.  To  apply  the  fuzzy  connectivity  model, 
we  need  to  estimate  different  parameters.  Studying  all  the  120  images  from  all  the  60 
patients,  we  found  that  the  intensity  histogram  always  contains  a  highly  prominent  peak 
at  the  lower  intensities;  and  that  peak  is  mostly  contributed  by  background.  The  first 
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prominent  peak  in  the  intensity  histogram  is  detected  and  used  to  (roughly)  calculate 
mean  and  standard  deviation  of  background  intensity.  To  apply  the  fuzzy  connectivity 
algorithm,  we  need  to  select  a  set  of  reference  (seed)  pixels.  For  this  purpose,  we  assume 
that  the  rightmost  column  in  the  image  always  lies  in  background  and  include  all  these 
pixels  as  the  reference  set.  Fuzzy  connectedness  processing  starting  from  those  pixels 
gives  us  a  fuzzy  connectivity  image  for  background.  We  discard  connectivity  strengths  in 
the  upper  half  and  keep  the  lower  half  as  the  breast  region.  Figures  (a)  and  (b)  show  an 
original  digitized  mammogram  and  the  segmented  breast. 

Fuzzy  connectivity  image  for  glandular  tissue:  Fuzzy  connectivity  method  is  used  to 
enhance  glandular  dense  regions  and  to  suppress  fat  tissues;  the  resulting  fuzzy 
connectivity  image,  in  turn,  is  used  for  automatic  segmentation  of  glandular  region  from 
fat.  The  major  task  in  applying  the  fuzzy  connectivity  model  is  to  estimate  the  parameters 
of  the  algorithm  and  to  select  the  set  of  reference  pixels.  After  ignoring  the  upper  0.01 
percentile  intensities,  the  mean  and  standard  deviation  parameters  (for  the  homogeneity 
and  feature-based  affinity)  are  estimated  from  those  part  of  the  breast  region  falling  in  the 
upper  25%  of  the  intensity  range.  Finally,  the  pixels  in  the  breast  region  falling  in  the 
upper  15%  of  the  intensity  range  are  selected  as  reference  pixels.  The  fuzzy  connected 
image  for  the  glandular  tissue  is  computed  (Figure ).  In  the  next  paragraph  we  describe  an 
automatic  threshold  selection  method  for  quantifying  density  from  this  image. 

Automatic  threshold  selection:  For  any  threshold,  the  image  is  divided  into  two  regions. 
Local  homogeneity  based  affinity  between  every  pair  of  spatially  adjacent  pixels  is  used 
to  define  their  likeliness  of  belonging  to  the  same  object  or  of  not  belonging  to  the  same 
object.  The  optimum  threshold  is  determined  from  the  associated  statistics  called 
threshold  energy.  The  threshold  with  the  minimum  threshold  energy  is  selected  as  the 
optimum  threshold.  The  segmented  binary  image  for  our  example  is  shown  in  Figure  (d). 

Finally,  we  generate  several  descriptors  from  the  segmented  and  the  original  image  to 
quantify  the  glandular  tissues  as  described  below.  Note  that  all  steps  are  completely 
automatic  except  the  exclusion  of  pectoral  muscles  if  they  are  included  in  the 
mammogram. 


10 


Results 


The  method  has  been  tested  on  over  60  studies  (each  study  produces  two  digitized 
mammograms)  from  routine  exams  from  two  projections  (CC  and  MLO).  The  images 
were  scanned  on  a  Lumisys  scanner  at  a  resolution  of  100  microns.  The  population 
included  normal  as  well  as  cancer  cases  (masses  and  calcifications).  Except  the  exclusion 
of  pectoral  muscles,  the  entire  method  has  worked  automatically  on  all  images  wherein 
all  parameters  required  by  the  algorithms  are  selected  automatically.  The  algorithms 
produced  visually  acceptable  segmentations  in  all  120  images.  From  the  segmented 
regions  and  the  image  intensities  in  them,  we  compute  a  set  of  density  related  parameters 
including  total  glandularity(TG),  TG/total  fat(TF),  TG/average  fat(AvF),  TG/area  of 
breast(AB),  area  of  glandularity(AG),  AG/area  of  fat(AF),  AG/AB.  TG  and  TF  are 
computed  by  integrating  radiographic  intensity  over  respective  segmented  regions  while 
AG,  AB  and  AF  are  computed  by  counting  the  number  of  pixels  in  respective  regions; 
finally,  AvF  is  computed  by  dividing  TF  by  AF.  The  correlation  between  the  parameters 
from  the  two  projections  (CC  and  MLO)  is  studied.  The  correlations  for  TG,  TG/TF, 
TG/AvF,  TG/AB,  AG,  AG/AF  and  AG/AB  are  0.967,  0.902,  0.951,  0.944,  0.959,  0.915 
and  0.941,  respectively;  the  plots  are  shown  in  Figures  2.(a)-(g).  In  all  figures,  the 
horizontal  axis  represents  the  estimated  parameter  from  the  CC  projection  while  the 
vertical  axis  represents  the  estimate  from  the  MLO  projection. 


Conclusions 


1  The  new  scale-based  fuzzy  connectedness  method  is  more  robust  and  effective  than 
the  original  method  [17].  It  is  very  effective  for  mammographic  image  segmentation. 

2  The  method  seems  to  be  appropriate  for  characterizing  architectural  abnormalities  and 
for  lesion  detection.  Its  evaluation  for  these  purpose  is  currently  underway. 
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3  Glandularity  is  considered  to  be  one  of  the  strongest  factors  for  breast  cancer. 
Automatic  breast  glandularity  quantification  from  digitized  mammograms  is  practical 
using  the  proposed  method.  The  method  removes  the  subjectivity  inherent  in 
interactive  threshold  selection  techniques  currently  used.  The  ability  of  the  computed 
glandularity  parameters  in  evaluating  risk  is  currently  being  investigated. 
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Appendix 


Figure  Captions: 

Figure  1:  (a)  A  digitized  mammographic  image,  (b)  Segmented  breast  region,  (c) 
Connectivity  image  for  glandular  tissue,  (d)  Segmented  dense  regions. 


Figure  2:  Graphical  representations  for  different  parameters  computed  from  two  different 
projections  (CC  and  MLO).  In  all  graphs,  the  x-value  represents  the  estimated  parameter 
from  CC  projection  while  the  y-value  represents  the  estimated  parameter  from  MLO 
projection,  (a)  Total  glandularity(TG).  (b)  TG/total  fat(TF).  (c)  TG/average  fat(AvF)  (d) 
TG/area  of  breast(AB).  (e)  Area  of  glandularity(AG).  (f)  AG/area  of  fat(AF).  (g)  AG/AB. 
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Figure  2.(c) 
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